Overview

Dataset statistics

Number of variables17
Number of observations5134
Missing cells38156
Missing cells (%)43.7%
Duplicate rows1
Duplicate rows (%)< 0.1%
Total size in memory2.3 MiB
Average record size in memory473.9 B

Variable types

NUM12
CAT5

Reproduction

Analysis started2020-06-09 20:32:15.518738
Analysis finished2020-06-09 20:32:35.017316
Duration19.5 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

referenceDate has constant value "2020-06-09" Constant
Dataset has 1 (< 0.1%) duplicate rows Duplicates
regionId has a high cardinality: 5132 distinct values High cardinality
label has a high cardinality: 5132 distinct values High cardinality
lastUpdatedDate has a high cardinality: 77 distinct values High cardinality
dataSource has a high cardinality: 158 distinct values High cardinality
totalConfirmedCases is highly correlated with totalDeaths and 7 other fieldsHigh correlation
totalDeaths is highly correlated with totalConfirmedCases and 6 other fieldsHigh correlation
totalRecoveredCases is highly correlated with totalConfirmedCasesHigh correlation
numPositiveTests is highly correlated with totalDeaths and 6 other fieldsHigh correlation
numDeaths is highly correlated with totalDeaths and 6 other fieldsHigh correlation
numRecoveredCases is highly correlated with totalTestedCases and 1 other fieldsHigh correlation
totalTestedCases is highly correlated with numRecoveredCases and 1 other fieldsHigh correlation
diffNumPositiveTests is highly correlated with totalDeaths and 6 other fieldsHigh correlation
avgWeeklyDeaths is highly correlated with totalDeaths and 6 other fieldsHigh correlation
avgWeeklyConfirmedCases is highly correlated with totalDeaths and 6 other fieldsHigh correlation
avgWeeklyRecoveredCases is highly correlated with totalDeaths and 9 other fieldsHigh correlation
diffNumDeaths is highly correlated with avgWeeklyRecoveredCasesHigh correlation
lastUpdatedDate has 1809 (35.2%) missing values Missing
totalDeaths has 1899 (37.0%) missing values Missing
totalRecoveredCases has 4040 (78.7%) missing values Missing
totalTestedCases has 3301 (64.3%) missing values Missing
numPositiveTests has 3535 (68.9%) missing values Missing
numDeaths has 4551 (88.6%) missing values Missing
numRecoveredCases has 4881 (95.1%) missing values Missing
diffNumPositiveTests has 3514 (68.4%) missing values Missing
diffNumDeaths has 4549 (88.6%) missing values Missing
avgWeeklyDeaths has 1948 (37.9%) missing values Missing
avgWeeklyRecoveredCases has 4057 (79.0%) missing values Missing
totalDeaths is highly skewed (γ1 = 48.68424864) Skewed
totalConfirmedCases is highly skewed (γ1 = 62.5309263) Skewed
totalTestedCases is highly skewed (γ1 = 31.76561006) Skewed
numPositiveTests is highly skewed (γ1 = 35.95050739) Skewed
numDeaths is highly skewed (γ1 = 20.97331424) Skewed
diffNumPositiveTests is highly skewed (γ1 = -38.29243544) Skewed
avgWeeklyDeaths is highly skewed (γ1 = 48.28546627) Skewed
avgWeeklyConfirmedCases is highly skewed (γ1 = 62.85280488) Skewed
avgWeeklyRecoveredCases is highly skewed (γ1 = 31.09397052) Skewed
regionId is uniformly distributed Uniform
label is uniformly distributed Uniform
totalDeaths has 989 (19.3%) zeros Zeros
totalConfirmedCases has 109 (2.1%) zeros Zeros
totalRecoveredCases has 53 (1.0%) zeros Zeros
numPositiveTests has 1296 (25.2%) zeros Zeros
numDeaths has 419 (8.2%) zeros Zeros
numRecoveredCases has 86 (1.7%) zeros Zeros
diffNumPositiveTests has 855 (16.7%) zeros Zeros
diffNumDeaths has 425 (8.3%) zeros Zeros
avgWeeklyDeaths has 2121 (41.3%) zeros Zeros
avgWeeklyConfirmedCases has 1169 (22.8%) zeros Zeros
avgWeeklyRecoveredCases has 306 (6.0%) zeros Zeros

Variables

regionId
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count5132
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size40.2 KiB
North_Macedonia
 
2
O%27Higgins_Region
 
2
Liberty_County,_Georgia
 
1
Saxony-Anhalt
 
1
Canton_of_Fribourg
 
1
Other values (5127)
5127
ValueCountFrequency (%) 
North_Macedonia2< 0.1%
 
O%27Higgins_Region2< 0.1%
 
Liberty_County,_Georgia1< 0.1%
 
Saxony-Anhalt1< 0.1%
 
Canton_of_Fribourg1< 0.1%
 
29436,_Cross,_South_Carolina,_South_Carolina1< 0.1%
 
60008,_Rolling_Meadows,_Illinois,_Illinois1< 0.1%
 
Madhya_Pradesh1< 0.1%
 
McPherson_County,_South_Dakota1< 0.1%
 
Gardner,_Massachusetts1< 0.1%
 
New_Marlborough,_Massachusetts1< 0.1%
 
St._Lawrence_County,_New_York1< 0.1%
 
Gratiot_County,_Michigan1< 0.1%
 
Pskov_Oblast1< 0.1%
 
60564,_Naperville,_Illinois,_Illinois1< 0.1%
 
Dornbirn_District1< 0.1%
 
Isle_of_Wight_County,_Virginia1< 0.1%
 
Henry_County,_Indiana1< 0.1%
 
Blackford_County,_Indiana1< 0.1%
 
Ben_Hill_County,_Georgia1< 0.1%
 
Rio_Grande_do_Sul1< 0.1%
 
29530,_Coward,_South_Carolina,_South_Carolina1< 0.1%
 
Haralson_County,_Georgia1< 0.1%
 
Newport_Beach,_California1< 0.1%
 
Montcalm_County,_Michigan1< 0.1%
 
Other values (5107)510799.5%
 

Length

Max length61
Median length24
Mean length24.6986755
Min length3

Overview of Unicode Properties

Unique unicode characters69
Unique unicode categories (?)8
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
_111508.8%
 
o104578.2%
 
n99437.8%
 
a98077.7%
 
i79446.3%
 
t71235.6%
 
s62054.9%
 
,58914.6%
 
e57454.5%
 
l57324.5%
 
u53694.2%
 
r50804.0%
 
C46923.7%
 
y36442.9%
 
h28122.2%
 
c16151.3%
 
d14611.2%
 
S13651.1%
 
g13601.1%
 
I13311.0%
 
M13301.0%
 
k11190.9%
 
m8660.7%
 
67570.6%
 
27040.6%
 
Other values (44)1330110.5%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter8958370.6%
 
Uppercase Letter1560912.3%
 
Connector Punctuation111508.8%
 
Other Punctuation61074.8%
 
Decimal Number42493.4%
 
Dash Punctuation59< 0.1%
 
Open Punctuation23< 0.1%
 
Close Punctuation23< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
675717.8%
 
270416.6%
 
066715.7%
 
951012.0%
 
14009.4%
 
43137.4%
 
52866.7%
 
32556.0%
 
81834.3%
 
71744.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
,589196.5%
 
%1732.8%
 
.430.7%
 

Most frequent Connector Punctuation characters

ValueCountFrequency (%) 
_11150100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C469230.1%
 
S13658.7%
 
I13318.5%
 
M13308.5%
 
P5733.7%
 
N5683.6%
 
W5463.5%
 
T5363.4%
 
A5343.4%
 
B4813.1%
 
G4312.8%
 
L4272.7%
 
D3782.4%
 
K3592.3%
 
H3452.2%
 
O3182.0%
 
V3172.0%
 
R3112.0%
 
F2561.6%
 
J1601.0%
 
E1581.0%
 
Y960.6%
 
U680.4%
 
Z140.1%
 
Q140.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o1045711.7%
 
n994311.1%
 
a980710.9%
 
i79448.9%
 
t71238.0%
 
s62056.9%
 
e57456.4%
 
l57326.4%
 
u53696.0%
 
r50805.7%
 
y36444.1%
 
h28123.1%
 
c16151.8%
 
d14611.6%
 
g13601.5%
 
k11191.2%
 
m8661.0%
 
b6750.8%
 
w6510.7%
 
f5370.6%
 
p4900.5%
 
v4570.5%
 
x3500.4%
 
z960.1%
 
q29< 0.1%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(23100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)23100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-59100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin10519283.0%
 
Common2161117.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
_1115051.6%
 
,589127.3%
 
67573.5%
 
27043.3%
 
06673.1%
 
95102.4%
 
14001.9%
 
43131.4%
 
52861.3%
 
32551.2%
 
81830.8%
 
71740.8%
 
%1730.8%
 
-590.3%
 
.430.2%
 
(230.1%
 
)230.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o104579.9%
 
n99439.5%
 
a98079.3%
 
i79447.6%
 
t71236.8%
 
s62055.9%
 
e57455.5%
 
l57325.4%
 
u53695.1%
 
r50804.8%
 
C46924.5%
 
y36443.5%
 
h28122.7%
 
c16151.5%
 
d14611.4%
 
S13651.3%
 
g13601.3%
 
I13311.3%
 
M13301.3%
 
k11191.1%
 
m8660.8%
 
b6750.6%
 
w6510.6%
 
P5730.5%
 
N5680.5%
 
Other values (27)77257.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII126803100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
_111508.8%
 
o104578.2%
 
n99437.8%
 
a98077.7%
 
i79446.3%
 
t71235.6%
 
s62054.9%
 
,58914.6%
 
e57454.5%
 
l57324.5%
 
u53694.2%
 
r50804.0%
 
C46923.7%
 
y36442.9%
 
h28122.2%
 
c16151.3%
 
d14611.2%
 
S13651.1%
 
g13601.1%
 
I13311.0%
 
M13301.0%
 
k11190.9%
 
m8660.7%
 
67570.6%
 
27040.6%
 
Other values (44)1330110.5%
 

label
Categorical

HIGH CARDINALITY
UNIFORM

Distinct count5132
Unique (%)> 99.9%
Missing0
Missing (%)0.0%
Memory size40.2 KiB
Republic of Macedonia
 
2
O'Higgins Region
 
2
Franklin County, Pennsylvania
 
1
Ray County, Missouri
 
1
Johnson County, Arkansas
 
1
Other values (5127)
5127
ValueCountFrequency (%) 
Republic of Macedonia2< 0.1%
 
O'Higgins Region2< 0.1%
 
Franklin County, Pennsylvania1< 0.1%
 
Ray County, Missouri1< 0.1%
 
Johnson County, Arkansas1< 0.1%
 
Eastham, Massachusetts1< 0.1%
 
Alachua County, Florida1< 0.1%
 
61455, Macomb, Illinois, Illinois1< 0.1%
 
Chattooga County, Georgia1< 0.1%
 
62232, Caseyville, Illinois, Illinois1< 0.1%
 
Charleston County, South Carolina1< 0.1%
 
Andrews County, Texas1< 0.1%
 
Barber County, Kansas1< 0.1%
 
Hanover, Massachusetts1< 0.1%
 
Brazos County, Texas1< 0.1%
 
29472, Ridgeville, South Carolina, South Carolina1< 0.1%
 
Grayson County, Virginia1< 0.1%
 
Lawrence County, Pennsylvania1< 0.1%
 
Henderson County, Illinois1< 0.1%
 
Rusk County, Texas1< 0.1%
 
Grand Isle County, Vermont1< 0.1%
 
Niobrara County, Wyoming1< 0.1%
 
Townsend, Massachusetts1< 0.1%
 
29180, Winnsboro, South Carolina, South Carolina1< 0.1%
 
Venango County, Pennsylvania1< 0.1%
 
Other values (5107)510799.5%
 

Length

Max length61
Median length24
Mean length24.61492014
Min length3

Overview of Unicode Properties

Unique unicode characters100
Unique unicode categories (?)8
Unique unicode scripts (?)2
Unique unicode blocks (?)4
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
111498.8%
 
o104578.3%
 
n99437.9%
 
a98067.8%
 
i79456.3%
 
t71165.6%
 
s62024.9%
 
,58914.7%
 
e57454.5%
 
l57354.5%
 
u53724.3%
 
r50784.0%
 
C46313.7%
 
y36442.9%
 
h28102.2%
 
c16181.3%
 
d14621.2%
 
S13651.1%
 
g13601.1%
 
I13311.1%
 
M13301.1%
 
k11190.9%
 
m8660.7%
 
67520.6%
 
26880.5%
 
Other values (75)1295810.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter8965270.9%
 
Uppercase Letter1542512.2%
 
Space Separator111498.8%
 
Other Punctuation59454.7%
 
Decimal Number40993.2%
 
Dash Punctuation61< 0.1%
 
Open Punctuation21< 0.1%
 
Close Punctuation21< 0.1%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
675218.3%
 
268816.8%
 
065616.0%
 
949412.1%
 
13759.1%
 
43077.5%
 
52867.0%
 
31994.9%
 
81784.3%
 
71644.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
,589199.1%
 
.410.7%
 
'80.1%
 
%50.1%
 

Most frequent Space Separator characters

ValueCountFrequency (%) 
11149100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
C463130.0%
 
S13658.8%
 
I13318.6%
 
M13308.6%
 
P5733.7%
 
N5663.7%
 
W5463.5%
 
T5363.5%
 
A4683.0%
 
B4492.9%
 
G4312.8%
 
L4272.8%
 
D3722.4%
 
K3592.3%
 
H3452.2%
 
O3182.1%
 
V3172.1%
 
R3132.0%
 
F2521.6%
 
J1601.0%
 
E1400.9%
 
Y960.6%
 
U670.4%
 
Z140.1%
 
Q140.1%
 
Other values (4)5< 0.1%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o1045711.7%
 
n994311.1%
 
a980610.9%
 
i79458.9%
 
t71167.9%
 
s62026.9%
 
e57456.4%
 
l57356.4%
 
u53726.0%
 
r50785.7%
 
y36444.1%
 
h28103.1%
 
c16181.8%
 
d14621.6%
 
g13601.5%
 
k11191.2%
 
m8661.0%
 
b6770.8%
 
w6510.7%
 
f5390.6%
 
p4930.5%
 
v4570.5%
 
x3500.4%
 
z970.1%
 
q29< 0.1%
 
Other values (27)810.1%
 

Most frequent Open Punctuation characters

ValueCountFrequency (%) 
(21100.0%
 

Most frequent Close Punctuation characters

ValueCountFrequency (%) 
)21100.0%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-5996.7%
 
23.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin10507783.1%
 
Common2129616.9%
 

Most frequent Common characters

ValueCountFrequency (%) 
1114952.4%
 
,589127.7%
 
67523.5%
 
26883.2%
 
06563.1%
 
94942.3%
 
13751.8%
 
43071.4%
 
52861.3%
 
31990.9%
 
81780.8%
 
71640.8%
 
-590.3%
 
.410.2%
 
(210.1%
 
)210.1%
 
'8< 0.1%
 
%5< 0.1%
 
2< 0.1%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o1045710.0%
 
n99439.5%
 
a98069.3%
 
i79457.6%
 
t71166.8%
 
s62025.9%
 
e57455.5%
 
l57355.5%
 
u53725.1%
 
r50784.8%
 
C46314.4%
 
y36443.5%
 
h28102.7%
 
c16181.5%
 
d14621.4%
 
S13651.3%
 
g13601.3%
 
I13311.3%
 
M13301.3%
 
k11191.1%
 
m8660.8%
 
b6770.6%
 
w6510.6%
 
P5730.5%
 
N5660.5%
 
Other values (56)76757.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII12630299.9%
 
None55< 0.1%
 
Latin Ext Additional14< 0.1%
 
Punctuation2< 0.1%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
111498.8%
 
o104578.3%
 
n99437.9%
 
a98067.8%
 
i79456.3%
 
t71165.6%
 
s62024.9%
 
,58914.7%
 
e57454.5%
 
l57354.5%
 
u53724.3%
 
r50784.0%
 
C46313.7%
 
y36442.9%
 
h28102.2%
 
c16181.3%
 
d14621.2%
 
S13651.1%
 
g13601.1%
 
I13311.1%
 
M13301.1%
 
k11190.9%
 
m8660.7%
 
67520.6%
 
26880.5%
 
Other values (45)1288710.2%
 

Most frequent None characters

ValueCountFrequency (%) 
é712.7%
 
â59.1%
 
á59.1%
 
à59.1%
 
í47.3%
 
ê47.3%
 
ì35.5%
 
ô23.6%
 
ü23.6%
 
ơ23.6%
 
ñ23.6%
 
Đ23.6%
 
è23.6%
 
ĩ23.6%
 
ư23.6%
 
ö11.8%
 
Î11.8%
 
ò11.8%
 
Ñ11.8%
 
ó11.8%
 
ú11.8%
 

Most frequent Latin Ext Additional characters

ValueCountFrequency (%) 
321.4%
 
214.3%
 
ế214.3%
 
214.3%
 
214.3%
 
17.1%
 
17.1%
 
17.1%
 

Most frequent Punctuation characters

ValueCountFrequency (%) 
2100.0%
 

referenceDate
Categorical

CONSTANT
REJECTED

Distinct count1
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size40.2 KiB
2020-06-09
5134
ValueCountFrequency (%) 
2020-06-095134100.0%
 

Length

Max length10
Median length10
Mean length10
Min length10

Overview of Unicode Properties

Unique unicode characters5
Unique unicode categories (?)2
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
02053640.0%
 
21026820.0%
 
-1026820.0%
 
6513410.0%
 
9513410.0%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number4107280.0%
 
Dash Punctuation1026820.0%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
02053650.0%
 
21026825.0%
 
6513412.5%
 
9513412.5%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-10268100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Common51340100.0%
 

Most frequent Common characters

ValueCountFrequency (%) 
02053640.0%
 
21026820.0%
 
-1026820.0%
 
6513410.0%
 
9513410.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII51340100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
02053640.0%
 
21026820.0%
 
-1026820.0%
 
6513410.0%
 
9513410.0%
 

lastUpdatedDate
Categorical

HIGH CARDINALITY
MISSING

Distinct count77
Unique (%)2.3%
Missing1809
Missing (%)35.2%
Memory size40.2 KiB
2020-06-09T05:00:00.000Z
462
2020-06-08T20:00:00.000Z
423
2020-06-09T04:00:00.000Z
357
2020-06-08T16:00:00.000Z
 
213
2020-06-08T23:00:00.000Z
 
207
Other values (72)
1663
ValueCountFrequency (%) 
2020-06-09T05:00:00.000Z4629.0%
 
2020-06-08T20:00:00.000Z4238.2%
 
2020-06-09T04:00:00.000Z3577.0%
 
2020-06-08T16:00:00.000Z2134.1%
 
2020-06-08T23:00:00.000Z2074.0%
 
2020-06-08T19:00:04.000Z1603.1%
 
2020-06-09T13:00:00.000Z1442.8%
 
2020-06-08T00:00:00.000Z1322.6%
 
2020-06-09T11:41:20.000Z1142.2%
 
2020-06-08T17:00:00.000Z1072.1%
 
2020-06-08T18:00:00.000Z1062.1%
 
2020-06-08T19:00:00.000Z1042.0%
 
2020-06-08T03:00:00.000Z881.7%
 
2020-06-09T07:35:00.000Z821.6%
 
2020-06-09T16:00:00.000Z801.6%
 
2020-06-09T06:00:00.000Z731.4%
 
2020-06-08T07:00:00.000Z400.8%
 
2020-06-08T14:00:00.000Z380.7%
 
2020-06-08T19:01:00.000Z370.7%
 
2020-06-09T08:00:00.000Z340.7%
 
2020-06-09T00:00:00.000Z330.6%
 
2020-06-08T22:00:00.000Z320.6%
 
2020-06-09T15:00:00.000Z290.6%
 
2020-06-09T14:00:00.000Z250.5%
 
2020-06-08T21:00:00.000Z240.5%
 
Other values (52)1813.5%
 
(Missing)180935.2%
 

Length

Max length24
Median length24
Mean length16.55414881
Min length3

Overview of Unicode Properties

Unique unicode characters17
Unique unicode categories (?)5
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
03763044.3%
 
275258.9%
 
-66507.8%
 
:66167.8%
 
637164.4%
 
n36184.3%
 
T33083.9%
 
.33083.9%
 
Z33083.9%
 
819092.2%
 
918812.2%
 
a18092.1%
 
115711.8%
 
47260.9%
 
56010.7%
 
35460.6%
 
72670.3%
 

Most occurring categories

ValueCountFrequency (%) 
Decimal Number5637266.3%
 
Other Punctuation992411.7%
 
Dash Punctuation66507.8%
 
Uppercase Letter66167.8%
 
Lowercase Letter54276.4%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
03763066.8%
 
2752513.3%
 
637166.6%
 
819093.4%
 
918813.3%
 
115712.8%
 
47261.3%
 
56011.1%
 
35461.0%
 
72670.5%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-6650100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
T330850.0%
 
Z330850.0%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
:661666.7%
 
.330833.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
n361866.7%
 
a180933.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Common7294685.8%
 
Latin1204314.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
03763051.6%
 
2752510.3%
 
-66509.1%
 
:66169.1%
 
637165.1%
 
.33084.5%
 
819092.6%
 
918812.6%
 
115712.2%
 
47261.0%
 
56010.8%
 
35460.7%
 
72670.4%
 

Most frequent Latin characters

ValueCountFrequency (%) 
n361830.0%
 
T330827.5%
 
Z330827.5%
 
a180915.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII84989100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
03763044.3%
 
275258.9%
 
-66507.8%
 
:66167.8%
 
637164.4%
 
n36184.3%
 
T33083.9%
 
.33083.9%
 
Z33083.9%
 
819092.2%
 
918812.2%
 
a18092.1%
 
115711.8%
 
47260.9%
 
56010.7%
 
35460.6%
 
72670.3%
 

totalDeaths
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct count388
Unique (%)12.0%
Missing1899
Missing (%)37.0%
Infinite0
Infinite (%)0.0%
Mean353.65935085007726
Minimum0.0
Maximum407403.0
Zeros989
Zeros (%)19.3%
Memory size40.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q319
95-th percentile426.3
Maximum407403
Range407403
Interquartile range (IQR)19

Descriptive statistics

Standard deviation7598.978075
Coefficient of variation (CV)21.48671612
Kurtosis2562.233375
Mean353.6593509
Median Absolute Deviation (MAD)2
Skewness48.68424864
Sum1144088
Variance57744467.78
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
098919.3%
 
14138.0%
 
22264.4%
 
31372.7%
 
41012.0%
 
5851.7%
 
6631.2%
 
9491.0%
 
8480.9%
 
7450.9%
 
11420.8%
 
10360.7%
 
12340.7%
 
13320.6%
 
17290.6%
 
18280.5%
 
14240.5%
 
15240.5%
 
16200.4%
 
22190.4%
 
20180.4%
 
31180.4%
 
27180.4%
 
23170.3%
 
26170.3%
 
Other values (363)70313.7%
 
(Missing)189937.0%
 
ValueCountFrequency (%) 
098919.3%
 
14138.0%
 
22264.4%
 
31372.7%
 
41012.0%
 
5851.7%
 
6631.2%
 
7450.9%
 
8480.9%
 
9491.0%
 
ValueCountFrequency (%) 
4074031< 0.1%
 
1110071< 0.1%
 
408831< 0.1%
 
379611< 0.1%
 
339641< 0.1%
 
302391< 0.1%
 
292091< 0.1%
 
271361< 0.1%
 
163021< 0.1%
 
159371< 0.1%
 

totalConfirmedCases
Real number (ℝ≥0)

HIGH CORRELATION
SKEWED
ZEROS

Distinct count1331
Unique (%)26.0%
Missing24
Missing (%)0.5%
Infinite0
Infinite (%)0.0%
Mean3980.7665362035227
Minimum0.0
Maximum7107780.0
Zeros109
Zeros (%)2.1%
Memory size40.2 KiB

Quantile statistics

Minimum0
5-th percentile2
Q116
median69.5
Q3323.75
95-th percentile4737.2
Maximum7107780
Range7107780
Interquartile range (IQR)307.75

Descriptive statistics

Standard deviation104712.3784
Coefficient of variation (CV)26.30457663
Kurtosis4172.440915
Mean3980.766536
Median Absolute Deviation (MAD)64.5
Skewness62.5309263
Sum20341717
Variance1.096468219e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11372.7%
 
31322.6%
 
01092.1%
 
5971.9%
 
2961.9%
 
8831.6%
 
6801.6%
 
7801.6%
 
4791.5%
 
13641.2%
 
9621.2%
 
12601.2%
 
19521.0%
 
10521.0%
 
11501.0%
 
14460.9%
 
15460.9%
 
21420.8%
 
18410.8%
 
16400.8%
 
20390.8%
 
17390.8%
 
22360.7%
 
23350.7%
 
39330.6%
 
Other values (1306)348067.8%
 
ValueCountFrequency (%) 
01092.1%
 
11372.7%
 
2961.9%
 
31322.6%
 
4791.5%
 
5971.9%
 
6801.6%
 
7801.6%
 
8831.6%
 
9621.2%
 
ValueCountFrequency (%) 
71077801< 0.1%
 
19611851< 0.1%
 
7230661< 0.1%
 
4852531< 0.1%
 
3787991< 0.1%
 
2891401< 0.1%
 
2665981< 0.1%
 
2417171< 0.1%
 
2352781< 0.1%
 
2073531< 0.1%
 

totalRecoveredCases
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
ZEROS

Distinct count462
Unique (%)42.2%
Missing4040
Missing (%)78.7%
Infinite0
Infinite (%)0.0%
Mean2733.0310786106033
Minimum0.0
Maximum296128.0
Zeros53
Zeros (%)1.0%
Memory size40.2 KiB

Quantile statistics

Minimum0
5-th percentile1
Q16
median29
Q3423.25
95-th percentile7992.3
Maximum296128
Range296128
Interquartile range (IQR)417.25

Descriptive statistics

Standard deviation16500.23002
Coefficient of variation (CV)6.037337135
Kurtosis160.4276894
Mean2733.031079
Median Absolute Deviation (MAD)28
Skewness11.58418918
Sum2989936
Variance272257590.8
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1701.4%
 
0531.0%
 
2390.8%
 
3380.7%
 
4320.6%
 
5300.6%
 
8250.5%
 
13190.4%
 
7190.4%
 
10160.3%
 
6160.3%
 
16150.3%
 
15150.3%
 
11150.3%
 
17140.3%
 
20120.2%
 
9110.2%
 
12110.2%
 
18110.2%
 
14100.2%
 
23100.2%
 
2290.2%
 
2190.2%
 
2790.2%
 
1990.2%
 
Other values (437)57711.2%
 
(Missing)404078.7%
 
ValueCountFrequency (%) 
0531.0%
 
1701.4%
 
2390.8%
 
3380.7%
 
4320.6%
 
5300.6%
 
6160.3%
 
7190.4%
 
8250.5%
 
9110.2%
 
ValueCountFrequency (%) 
2961281< 0.1%
 
2423971< 0.1%
 
1702001< 0.1%
 
1665841< 0.1%
 
1413801< 0.1%
 
1293141< 0.1%
 
1097371< 0.1%
 
895561< 0.1%
 
882171< 0.1%
 
783511< 0.1%
 

totalTestedCases
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
SKEWED

Distinct count1484
Unique (%)81.0%
Missing3301
Missing (%)64.3%
Infinite0
Infinite (%)0.0%
Mean34390.09056192035
Minimum0.0
Maximum13200000.0
Zeros14
Zeros (%)0.3%
Memory size40.2 KiB

Quantile statistics

Minimum0
5-th percentile39
Q1428
median1204
Q33356
95-th percentile91734.4
Maximum13200000
Range13200000
Interquartile range (IQR)2928

Descriptive statistics

Standard deviation343310.0899
Coefficient of variation (CV)9.982820175
Kurtosis1187.29101
Mean34390.09056
Median Absolute Deviation (MAD)983
Skewness31.76561006
Sum63037036
Variance1.178618178e+11
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0140.3%
 
570.1%
 
760.1%
 
460.1%
 
960.1%
 
6340.1%
 
1840.1%
 
70840.1%
 
53940.1%
 
140.1%
 
15040.1%
 
16640.1%
 
46440.1%
 
34440.1%
 
43540.1%
 
52030.1%
 
108330.1%
 
25930.1%
 
4730.1%
 
72730.1%
 
33930.1%
 
126630.1%
 
28430.1%
 
91330.1%
 
57830.1%
 
Other values (1459)172433.6%
 
(Missing)330164.3%
 
ValueCountFrequency (%) 
0140.3%
 
140.1%
 
22< 0.1%
 
330.1%
 
460.1%
 
570.1%
 
760.1%
 
82< 0.1%
 
960.1%
 
102< 0.1%
 
ValueCountFrequency (%) 
132000001< 0.1%
 
26434891< 0.1%
 
25558961< 0.1%
 
23779541< 0.1%
 
19301411< 0.1%
 
16506841< 0.1%
 
12861391< 0.1%
 
12355131< 0.1%
 
12039851< 0.1%
 
11537071< 0.1%
 

numPositiveTests
Real number (ℝ)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct count145
Unique (%)9.1%
Missing3535
Missing (%)68.9%
Infinite0
Infinite (%)0.0%
Mean97.32270168855534
Minimum-6.0
Maximum75908.0
Zeros1296
Zeros (%)25.2%
Memory size40.2 KiB

Quantile statistics

Minimum-6
5-th percentile0
Q10
median0
Q30
95-th percentile96.1
Maximum75908
Range75914
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1973.775048
Coefficient of variation (CV)20.28072601
Kurtosis1367.103292
Mean97.32270169
Median Absolute Deviation (MAD)0
Skewness35.95050739
Sum155619
Variance3895787.94
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0129625.2%
 
1541.1%
 
2130.3%
 
3120.2%
 
480.2%
 
760.1%
 
7550.1%
 
1050.1%
 
850.1%
 
1640.1%
 
940.1%
 
2630.1%
 
1330.1%
 
4530.1%
 
4630.1%
 
14330.1%
 
6530.1%
 
8330.1%
 
9730.1%
 
2430.1%
 
522< 0.1%
 
482< 0.1%
 
342< 0.1%
 
382< 0.1%
 
862< 0.1%
 
Other values (120)1502.9%
 
(Missing)353568.9%
 
ValueCountFrequency (%) 
-61< 0.1%
 
-42< 0.1%
 
-21< 0.1%
 
-11< 0.1%
 
0129625.2%
 
1541.1%
 
2130.3%
 
3120.2%
 
480.2%
 
52< 0.1%
 
ValueCountFrequency (%) 
759081< 0.1%
 
156541< 0.1%
 
99871< 0.1%
 
85951< 0.1%
 
46461< 0.1%
 
25941< 0.1%
 
25531< 0.1%
 
17211< 0.1%
 
15721< 0.1%
 
15621< 0.1%
 

numDeaths
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct count49
Unique (%)8.4%
Missing4551
Missing (%)88.6%
Infinite0
Infinite (%)0.0%
Mean12.39451114922813
Minimum0.0
Maximum3076.0
Zeros419
Zeros (%)8.2%
Memory size40.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q31
95-th percentile31.9
Maximum3076
Range3076
Interquartile range (IQR)1

Descriptive statistics

Standard deviation133.9566611
Coefficient of variation (CV)10.80774058
Kurtosis473.3996079
Mean12.39451115
Median Absolute Deviation (MAD)0
Skewness20.97331424
Sum7226
Variance17944.38705
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
04198.2%
 
1400.8%
 
3190.4%
 
2170.3%
 
4100.2%
 
590.2%
 
950.1%
 
750.1%
 
840.1%
 
640.1%
 
1730.1%
 
1230.1%
 
112< 0.1%
 
312< 0.1%
 
212< 0.1%
 
342< 0.1%
 
192< 0.1%
 
552< 0.1%
 
742< 0.1%
 
352< 0.1%
 
131< 0.1%
 
141< 0.1%
 
101< 0.1%
 
151< 0.1%
 
321< 0.1%
 
Other values (24)240.5%
 
(Missing)455188.6%
 
ValueCountFrequency (%) 
04198.2%
 
1400.8%
 
2170.3%
 
3190.4%
 
4100.2%
 
590.2%
 
640.1%
 
750.1%
 
840.1%
 
950.1%
 
ValueCountFrequency (%) 
30761< 0.1%
 
6791< 0.1%
 
4931< 0.1%
 
3541< 0.1%
 
3311< 0.1%
 
1711< 0.1%
 
1091< 0.1%
 
1061< 0.1%
 
1051< 0.1%
 
821< 0.1%
 

numRecoveredCases
Real number (ℝ≥0)

HIGH CORRELATION
MISSING
ZEROS

Distinct count105
Unique (%)41.5%
Missing4881
Missing (%)95.1%
Infinite0
Infinite (%)0.0%
Mean176.68774703557312
Minimum0.0
Maximum11709.0
Zeros86
Zeros (%)1.7%
Memory size40.2 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median10
Q368
95-th percentile435.8
Maximum11709
Range11709
Interquartile range (IQR)68

Descriptive statistics

Standard deviation948.5029314
Coefficient of variation (CV)5.368243963
Kurtosis98.10383865
Mean176.687747
Median Absolute Deviation (MAD)10
Skewness9.325382838
Sum44702
Variance899657.8108
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0861.7%
 
190.2%
 
370.1%
 
270.1%
 
1750.1%
 
740.1%
 
930.1%
 
9930.1%
 
6930.1%
 
430.1%
 
630.1%
 
1030.1%
 
4030.1%
 
1230.1%
 
2530.1%
 
8030.1%
 
3930.1%
 
1430.1%
 
192< 0.1%
 
162< 0.1%
 
682< 0.1%
 
362< 0.1%
 
652< 0.1%
 
152< 0.1%
 
82< 0.1%
 
Other values (80)851.7%
 
(Missing)488195.1%
 
ValueCountFrequency (%) 
0861.7%
 
190.2%
 
270.1%
 
370.1%
 
430.1%
 
51< 0.1%
 
630.1%
 
740.1%
 
82< 0.1%
 
930.1%
 
ValueCountFrequency (%) 
117091< 0.1%
 
60881< 0.1%
 
53901< 0.1%
 
48841< 0.1%
 
16611< 0.1%
 
8181< 0.1%
 
6631< 0.1%
 
6411< 0.1%
 
6261< 0.1%
 
6001< 0.1%
 

diffNumPositiveTests
Real number (ℝ)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct count111
Unique (%)6.9%
Missing3514
Missing (%)68.4%
Infinite0
Infinite (%)0.0%
Mean-23.298148148148147
Minimum-25890.0
Maximum353.0
Zeros855
Zeros (%)16.7%
Memory size40.2 KiB

Quantile statistics

Minimum-25890
5-th percentile-13
Q1-1
median0
Q30
95-th percentile2.05
Maximum353
Range26243
Interquartile range (IQR)1

Descriptive statistics

Standard deviation655.1291821
Coefficient of variation (CV)-28.11936717
Kurtosis1505.274672
Mean-23.29814815
Median Absolute Deviation (MAD)0
Skewness-38.29243544
Sum-37743
Variance429194.2452
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
085516.7%
 
-12003.9%
 
-21062.1%
 
1651.3%
 
-3631.2%
 
-4460.9%
 
-5260.5%
 
-6180.4%
 
2170.3%
 
-7170.3%
 
-8150.3%
 
3150.3%
 
-9110.2%
 
480.2%
 
-2270.1%
 
-1070.1%
 
-1260.1%
 
-1150.1%
 
2850.1%
 
740.1%
 
640.1%
 
2040.1%
 
-1340.1%
 
2140.1%
 
-1530.1%
 
Other values (86)1052.0%
 
(Missing)351468.4%
 
ValueCountFrequency (%) 
-258901< 0.1%
 
-46961< 0.1%
 
-7891< 0.1%
 
-5511< 0.1%
 
-5451< 0.1%
 
-5051< 0.1%
 
-4541< 0.1%
 
-4291< 0.1%
 
-4031< 0.1%
 
-3921< 0.1%
 
ValueCountFrequency (%) 
3531< 0.1%
 
2291< 0.1%
 
1961< 0.1%
 
1941< 0.1%
 
1821< 0.1%
 
1551< 0.1%
 
1361< 0.1%
 
1211< 0.1%
 
851< 0.1%
 
6930.1%
 

diffNumDeaths
Real number (ℝ)

HIGH CORRELATION
MISSING
ZEROS

Distinct count41
Unique (%)7.0%
Missing4549
Missing (%)88.6%
Infinite0
Infinite (%)0.0%
Mean-1.9162393162393163
Minimum-807.0
Maximum166.0
Zeros425
Zeros (%)8.3%
Memory size40.2 KiB

Quantile statistics

Minimum-807
5-th percentile-2
Q10
median0
Q30
95-th percentile3
Maximum166
Range973
Interquartile range (IQR)0

Descriptive statistics

Standard deviation43.18780459
Coefficient of variation (CV)-22.53779276
Kurtosis261.1873301
Mean-1.916239316
Median Absolute Deviation (MAD)0
Skewness-15.20969692
Sum-1121
Variance1865.186465
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
04258.3%
 
1460.9%
 
-1270.5%
 
-2140.3%
 
2120.2%
 
-470.1%
 
350.1%
 
-340.1%
 
440.1%
 
530.1%
 
830.1%
 
-182< 0.1%
 
112< 0.1%
 
-72< 0.1%
 
72< 0.1%
 
-52< 0.1%
 
401< 0.1%
 
-101< 0.1%
 
1251< 0.1%
 
-581< 0.1%
 
-171< 0.1%
 
311< 0.1%
 
-2191< 0.1%
 
131< 0.1%
 
-8071< 0.1%
 
Other values (16)160.3%
 
(Missing)454988.6%
 
ValueCountFrequency (%) 
-8071< 0.1%
 
-5751< 0.1%
 
-2191< 0.1%
 
-581< 0.1%
 
-241< 0.1%
 
-182< 0.1%
 
-171< 0.1%
 
-131< 0.1%
 
-111< 0.1%
 
-101< 0.1%
 
ValueCountFrequency (%) 
1661< 0.1%
 
1251< 0.1%
 
591< 0.1%
 
561< 0.1%
 
401< 0.1%
 
321< 0.1%
 
311< 0.1%
 
261< 0.1%
 
181< 0.1%
 
141< 0.1%
 

avgWeeklyDeaths
Real number (ℝ)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct count143
Unique (%)4.5%
Missing1948
Missing (%)37.9%
Infinite0
Infinite (%)0.0%
Mean3.238848085373509
Minimum-193.71
Maximum3985.57
Zeros2121
Zeros (%)41.3%
Memory size40.2 KiB

Quantile statistics

Minimum-193.71
5-th percentile0
Q10
median0
Q30.14
95-th percentile3.71
Maximum3985.57
Range4179.28
Interquartile range (IQR)0.14

Descriptive statistics

Standard deviation74.86834869
Coefficient of variation (CV)23.11573335
Kurtosis2526.273946
Mean3.238848085
Median Absolute Deviation (MAD)0
Skewness48.28546627
Sum10318.97
Variance5605.269636
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0212141.3%
 
0.143386.6%
 
0.291442.8%
 
0.43911.8%
 
0.57691.3%
 
0.71340.7%
 
0.86210.4%
 
1.29190.4%
 
1190.4%
 
1.43190.4%
 
-0.14180.4%
 
1.14170.3%
 
1.86150.3%
 
2.29140.3%
 
1.57120.2%
 
1.71120.2%
 
290.2%
 
2.1480.2%
 
5.4370.1%
 
3.5760.1%
 
2.7150.1%
 
5.2950.1%
 
2.4350.1%
 
3.1450.1%
 
7.4340.1%
 
Other values (118)1693.3%
 
(Missing)194837.9%
 
ValueCountFrequency (%) 
-193.711< 0.1%
 
-73.861< 0.1%
 
-2.141< 0.1%
 
-0.711< 0.1%
 
-0.571< 0.1%
 
-0.291< 0.1%
 
-0.14180.4%
 
0212141.3%
 
0.131< 0.1%
 
0.143386.6%
 
ValueCountFrequency (%) 
3985.571< 0.1%
 
9661< 0.1%
 
667.571< 0.1%
 
4881< 0.1%
 
267.571< 0.1%
 
216.291< 0.1%
 
176.571< 0.1%
 
160.861< 0.1%
 
157.861< 0.1%
 
153.711< 0.1%
 

avgWeeklyConfirmedCases
Real number (ℝ)

HIGH CORRELATION
SKEWED
ZEROS

Distinct count565
Unique (%)11.1%
Missing48
Missing (%)0.9%
Infinite0
Infinite (%)0.0%
Mean58.38501179709005
Minimum-1557.0
Maximum113467.29
Zeros1169
Zeros (%)22.8%
Memory size40.2 KiB

Quantile statistics

Minimum-1557
5-th percentile0
Q10.14
median0.57
Q33
95-th percentile64.2875
Maximum113467.29
Range115024.29
Interquartile range (IQR)2.86

Descriptive statistics

Standard deviation1667.523248
Coefficient of variation (CV)28.56081033
Kurtosis4221.526498
Mean58.3850118
Median Absolute Deviation (MAD)0.57
Skewness62.85280488
Sum296946.17
Variance2780633.784
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0116922.8%
 
0.1451110.0%
 
0.293356.5%
 
0.432444.8%
 
0.572034.0%
 
0.711793.5%
 
0.861462.8%
 
11322.6%
 
1.43921.8%
 
1.29911.8%
 
1.14831.6%
 
1.71821.6%
 
1.57771.5%
 
2.29671.3%
 
1.86561.1%
 
2531.0%
 
-0.14491.0%
 
2.14491.0%
 
2.71400.8%
 
3310.6%
 
3.29300.6%
 
3.14300.6%
 
3.57300.6%
 
2.57280.5%
 
3.43280.5%
 
Other values (540)125124.4%
 
(Missing)480.9%
 
ValueCountFrequency (%) 
-15571< 0.1%
 
-5.861< 0.1%
 
-1.711< 0.1%
 
-0.861< 0.1%
 
-0.431< 0.1%
 
-0.29100.2%
 
-0.14491.0%
 
0116922.8%
 
0.0130.1%
 
0.041< 0.1%
 
ValueCountFrequency (%) 
113467.291< 0.1%
 
23954.711< 0.1%
 
19298.711< 0.1%
 
9698.861< 0.1%
 
8787.431< 0.1%
 
4559.861< 0.1%
 
4308.571< 0.1%
 
3544.571< 0.1%
 
3420.291< 0.1%
 
3253.711< 0.1%
 

avgWeeklyRecoveredCases
Real number (ℝ)

HIGH CORRELATION
MISSING
SKEWED
ZEROS

Distinct count264
Unique (%)24.5%
Missing4057
Missing (%)79.0%
Infinite0
Infinite (%)0.0%
Mean147.45445682451253
Minimum-4.29
Maximum86942.71
Zeros306
Zeros (%)6.0%
Memory size40.2 KiB

Quantile statistics

Minimum-4.29
5-th percentile0
Q10
median0.43
Q36
95-th percentile145.026
Maximum86942.71
Range86947
Interquartile range (IQR)6

Descriptive statistics

Standard deviation2698.228542
Coefficient of variation (CV)18.29872491
Kurtosis997.9610713
Mean147.4544568
Median Absolute Deviation (MAD)0.43
Skewness31.09397052
Sum158808.45
Variance7280437.263
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
03066.0%
 
0.141372.7%
 
0.29661.3%
 
0.43410.8%
 
0.57330.6%
 
0.86250.5%
 
0.71230.4%
 
1200.4%
 
1.14180.4%
 
1.29130.3%
 
1.86120.2%
 
2.14110.2%
 
2.4390.2%
 
1.5790.2%
 
280.2%
 
3.8670.1%
 
1.4350.1%
 
1.7150.1%
 
9.4350.1%
 
4.2950.1%
 
3.5750.1%
 
4.4340.1%
 
2.5740.1%
 
17.4340.1%
 
7.5740.1%
 
Other values (239)2985.8%
 
(Missing)405779.0%
 
ValueCountFrequency (%) 
-4.291< 0.1%
 
-31< 0.1%
 
-0.1430.1%
 
03066.0%
 
0.141372.7%
 
0.29661.3%
 
0.43410.8%
 
0.57330.6%
 
0.71230.4%
 
0.86250.5%
 
ValueCountFrequency (%) 
86942.711< 0.1%
 
10355.711< 0.1%
 
79161< 0.1%
 
7807.711< 0.1%
 
4826.711< 0.1%
 
3066.861< 0.1%
 
2899.861< 0.1%
 
2558.431< 0.1%
 
1647.571< 0.1%
 
16371< 0.1%
 

dataSource
Categorical

HIGH CARDINALITY

Distinct count158
Unique (%)3.1%
Missing0
Missing (%)0.0%
Memory size40.2 KiB
http://www.dph.illinois.gov/
 
566
https://www.scdhec.gov/
 
403
https://www.mass.gov/
 
354
https://www.dshs.state.tx.us/
 
236
https://covid19.min-saude.pt/
 
168
Other values (153)
3407
ValueCountFrequency (%) 
http://www.dph.illinois.gov/56611.0%
 
https://www.scdhec.gov/4037.8%
 
https://www.mass.gov/3546.9%
 
https://www.dshs.state.tx.us/2364.6%
 
https://covid19.min-saude.pt/1683.3%
 
https://dph.georgia.gov/1603.1%
 
http://www.vdh.virginia.gov/1342.6%
 
https://govstatus.egov.com/1202.3%
 
https://www.ecdc.europa.eu/1142.2%
 
https://www.ncdhhs.gov/1012.0%
 
https://health.mo.gov/1012.0%
 
https://www.tn.gov/961.9%
 
http://dhhs.ne.gov/941.8%
 
https://coronavirus.in.gov/921.8%
 
https://coronavirus.ohio.gov/891.7%
 
https://www.coronavirus.kdheks.gov891.7%
 
https://www.health.state.mn.us/881.7%
 
http://publichealth.lacounty.gov/881.7%
 
https://www.michigan.gov/841.6%
 
https://msdh.ms.gov/831.6%
 
https://xn--80aesfpebagmfblc0a.xn--p1ai/#821.6%
 
https://www.healthy.arkansas.gov/761.5%
 
https://www.dhs.wisconsin.gov/731.4%
 
https://info.gesundheitsministerium.at/711.4%
 
https://www.health.pa.gov/681.3%
 
Other values (133)150429.3%
 

Length

Max length66
Median length27
Mean length26.30989482
Min length16

Overview of Unicode Properties

Unique unicode characters46
Unique unicode categories (?)5
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
/1522611.3%
 
t130479.7%
 
.122379.1%
 
h98337.3%
 
s93066.9%
 
w92636.9%
 
o81726.0%
 
p66875.0%
 
v56514.2%
 
:51343.8%
 
g50483.7%
 
i50083.7%
 
a47263.5%
 
d35952.7%
 
c31812.4%
 
n31502.3%
 
e31182.3%
 
l26732.0%
 
r19871.5%
 
u18491.4%
 
m17691.3%
 
-6200.5%
 
15420.4%
 
94600.3%
 
f4160.3%
 
Other values (21)23771.8%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter10006974.1%
 
Other Punctuation3267924.2%
 
Decimal Number13261.0%
 
Dash Punctuation6200.5%
 
Uppercase Letter3810.3%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
t1304713.0%
 
h98339.8%
 
s93069.3%
 
w92639.3%
 
o81728.2%
 
p66876.7%
 
v56515.6%
 
g50485.0%
 
i50085.0%
 
a47264.7%
 
d35953.6%
 
c31813.2%
 
n31503.1%
 
e31183.1%
 
l26732.7%
 
r19872.0%
 
u18491.8%
 
m17691.8%
 
f4160.4%
 
b4050.4%
 
x4010.4%
 
k3960.4%
 
y2980.3%
 
z510.1%
 
j24< 0.1%
 

Most frequent Other Punctuation characters

ValueCountFrequency (%) 
/1522646.6%
 
.1223737.4%
 
:513415.7%
 
#820.3%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
154240.9%
 
946034.7%
 
020115.2%
 
8836.3%
 
2382.9%
 
620.2%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-620100.0%
 

Most frequent Uppercase Letter characters

ValueCountFrequency (%) 
O11329.7%
 
C4010.5%
 
V3910.2%
 
I3810.0%
 
D3810.0%
 
H3810.0%
 
R379.7%
 
A379.7%
 
M10.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin10045074.4%
 
Common3462525.6%
 

Most frequent Latin characters

ValueCountFrequency (%) 
t1304713.0%
 
h98339.8%
 
s93069.3%
 
w92639.2%
 
o81728.1%
 
p66876.7%
 
v56515.6%
 
g50485.0%
 
i50085.0%
 
a47264.7%
 
d35953.6%
 
c31813.2%
 
n31503.1%
 
e31183.1%
 
l26732.7%
 
r19872.0%
 
u18491.8%
 
m17691.8%
 
f4160.4%
 
b4050.4%
 
x4010.4%
 
k3960.4%
 
y2980.3%
 
O1130.1%
 
z510.1%
 
Other values (10)3070.3%
 

Most frequent Common characters

ValueCountFrequency (%) 
/1522644.0%
 
.1223735.3%
 
:513414.8%
 
-6201.8%
 
15421.6%
 
94601.3%
 
02010.6%
 
8830.2%
 
#820.2%
 
2380.1%
 
62< 0.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII135075100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
/1522611.3%
 
t130479.7%
 
.122379.1%
 
h98337.3%
 
s93066.9%
 
w92636.9%
 
o81726.0%
 
p66875.0%
 
v56514.2%
 
:51343.8%
 
g50483.7%
 
i50083.7%
 
a47263.5%
 
d35952.7%
 
c31812.4%
 
n31502.3%
 
e31182.3%
 
l26732.0%
 
r19871.5%
 
u18491.4%
 
m17691.3%
 
-6200.5%
 
15420.4%
 
94600.3%
 
f4160.3%
 
Other values (21)23771.8%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

regionIdlabelreferenceDatelastUpdatedDatetotalDeathstotalConfirmedCasestotalRecoveredCasestotalTestedCasesnumPositiveTestsnumDeathsnumRecoveredCasesdiffNumPositiveTestsdiffNumDeathsavgWeeklyDeathsavgWeeklyConfirmedCasesavgWeeklyRecoveredCasesdataSource
029001,_Alcolu,_South_Carolina,_South_Carolina29001, Alcolu, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN12.0NaNNaN0.0NaNNaN0.0NaNNaN0.00NaNhttps://www.scdhec.gov/
129003,_Bamberg,_South_Carolina,_South_Carolina29003, Bamberg, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN12.0NaNNaN0.0NaNNaN0.0NaNNaN0.43NaNhttps://www.scdhec.gov/
229009,_Bethune,_South_Carolina,_South_Carolina29009, Bethune, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN11.0NaNNaN0.0NaNNaN0.0NaNNaN0.29NaNhttps://www.scdhec.gov/
329010,_Bishopville,_South_Carolina,_South_Carolina29010, Bishopville, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN197.0NaNNaN0.0NaNNaN0.0NaNNaN0.57NaNhttps://www.scdhec.gov/
429016,_Blythewood,_South_Carolina,_South_Carolina29016, Blythewood, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN81.0NaNNaN0.0NaNNaN-3.0NaNNaN1.57NaNhttps://www.scdhec.gov/
529018,_Bowman,_South_Carolina,_South_Carolina29018, Bowman, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN6.0NaNNaN0.0NaNNaN0.0NaNNaN0.14NaNhttps://www.scdhec.gov/
629020,_Camden_station_(South_Carolina),_South_Carolina29020, Camden station (South Carolina), South Carolina2020-06-092020-06-09T04:00:00.000ZNaN160.0NaNNaN0.0NaNNaN0.0NaNNaN1.14NaNhttps://www.scdhec.gov/
729030,_Cameron,_South_Carolina,_South_Carolina29030, Cameron, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN7.0NaNNaN0.0NaNNaN-1.0NaNNaN0.43NaNhttps://www.scdhec.gov/
829031,_Carlisle,_South_Carolina,_South_Carolina29031, Carlisle, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN2.0NaNNaN0.0NaNNaN0.0NaNNaN0.00NaNhttps://www.scdhec.gov/
929033,_Cayce,_South_Carolina,_South_Carolina29033, Cayce, South Carolina, South Carolina2020-06-092020-06-09T04:00:00.000ZNaN28.0NaNNaN0.0NaNNaN-5.0NaNNaN1.86NaNhttps://www.scdhec.gov/

Last rows

regionIdlabelreferenceDatelastUpdatedDatetotalDeathstotalConfirmedCasestotalRecoveredCasestotalTestedCasesnumPositiveTestsnumDeathsnumRecoveredCasesdiffNumPositiveTestsdiffNumDeathsavgWeeklyDeathsavgWeeklyConfirmedCasesavgWeeklyRecoveredCasesdataSource
5124Yukon%E2%80%93Koyukuk_Census_Area,_AlaskaYukon–Koyukuk Census Area, Alaska2020-06-09NaNNaN1.0NaNNaNNaNNaNNaNNaNNaNNaN0.00NaNhttp://www.dhss.alaska.gov/
5125Yuma_County,_ArizonaYuma County, Arizona2020-06-092020-06-09T15:00:00.000Z28.02378.0NaN17879.0121.02.02.0-5.01.01.86157.57NaNhttps://azdhs.gov/
5126Yuma_County,_ColoradoYuma County, Colorado2020-06-09NaNNaN48.0NaNNaNNaNNaNNaNNaNNaNNaN1.14NaNhttps://covid19.colorado.gov/
5127Zabaykalsky_KraiZabaykalsky Krai2020-06-092020-06-09T07:35:00.000Z31.01634.0942.0NaN83.02.080.03.00.01.4367.0052.29https://xn--80aesfpebagmfblc0a.xn--p1ai/#
5128ZambiaZambia2020-06-092020-06-09T11:41:20.000Z10.01200.0NaNNaNNaN3.0NaNNaN3.00.4315.86NaNhttps://www.ecdc.europa.eu/
5129Zapata_County,_TexasZapata County, Texas2020-06-09NaN0.012.08.0NaNNaNNaNNaNNaNNaN0.000.290.14https://www.dshs.state.tx.us/
5130Zavala_County,_TexasZavala County, Texas2020-06-09NaN0.012.09.0NaNNaNNaNNaNNaNNaN0.000.000.29https://www.dshs.state.tx.us/
5131Ziebach_County,_South_DakotaZiebach County, South Dakota2020-06-092020-06-09T17:00:00.000Z0.02.01.083.00.00.00.00.00.00.000.140.00https://doh.sd.gov/
5132ZimbabweZimbabwe2020-06-092020-06-09T11:41:20.000Z4.0287.0NaNNaNNaN0.0NaNNaN0.00.0012.00NaNhttps://www.ecdc.europa.eu/
5133Zwettl_DistrictZwettl District2020-06-092020-06-08T20:00:00.000ZNaN73.0NaNNaNNaNNaNNaNNaNNaNNaN0.00NaNhttps://info.gesundheitsministerium.at/